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COMB 1 NATO K I AL  OPTIMIZATION: 


WHAT  IS  THE  STATE  OF  THE  AKT? 


Victor  Kloo 


Department  of  Mathematics 
University  of  Washington 
Seattle,  Washington 


This  survey  attempts,  witliout  including  many  details  of 
algorithms  or  of  the  underlying  thc'ory,  to  answer  the  follow- 
ing questions:  What  is  combinatorial  optimization?  What  are 
the  landmarks  of  the  theorv'?  Kfiat  are  t tie  most  exciting  re- 
cent advances?  What  are  tlie  most  promising  directions  for 
research?  Wliat  iire  tlie  best  sources  of  furtlier  information? 


1.  WHAT  IS  COMFlINATOltl  Ah  OPTIMIZATION? 


Many  practical  problems,  especially  those  from  operations 
research  and  computei'  science,  are  concerned  witli  o(>timizing 
a real-valued  function  f over  a finite  set  x of  d-tuples 
of  integers.  Often  x is  not  presented  explicitly  but  is 
defined  implicitly  in  some  manni'r.  Wlii'n  f is  linear  and  x 


r 
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is  defined  by  a finite  number  of  linear  inequality  con- 
straints with  integer  coefficients,  the  problem  is  one  of 
(linear)  integer  programming.  Starting  witl>  tlie  work  of 
Gomory  (1958),  finite  algorithms  and  an  extensive  tlieory  have 
been  developed  for  sucli  problems.  See  Hu  (19C9),  Garfinkel 
and  Nemhauser  (1972),  Geoffrion  and  Marsten  (1972),  Halinski 
(1974)  and  Ilanuner  et  al.  (1977)  for  details  and  further  ref- 
erences. Computational  experience  wit)i  these  algorithms  lias 
been  mixed.  Some  fairly  large  probli'ins  have  been  solved,  but 
experience  often  shows,  even  with  problems  of  modest  size, 
that  an  algorithm  requiring  a very  large  finite  number  of 
steps  may  have  no  practical  advantage  over  one  requiring  an 
infinite  number. 

Most  of  combinatorial  optimization  deals  with  problems 
that  can  bo  formulated  as  integer  programs  but  have  an  undei — 
lying  combinatorial  structure  that  lends  itself  to  the 
development  of  special  algorithms.  One  naturally  hopes  that 
these  algorithms,  when  applicable,  will  be  more  efficient  than 
general  integei'  piograrami ng  algorithms.  Many  of  tlu'  prob- 
lems arc  associati'd  with  a directed  or  undirected  graph 
G = (N,A)  and  a function  a defined  on  the  (finite)  node- 
set N or  the  arc-set  A or  on  NnA  and  taking  values  in 

(G,a)  is  often  called  a network . It  is  assumed  for 
simplicity  that  G is  looploss,  so  that  A is  a sot  of 
ordered  or  unotdered  (according  as  G is  directed  or  not) 
pairs  of  distinct  nodes.  Arcs  may  be  written  as  ordered  pairs 
(i,j)  with  the  understanding  that  they  will  be  interpreted 


as  unordered  when  G is  undirected. 


A path  from  node  s to  node  t is  a sequence  of  arcs  of 


the  form 

with  Xp  = s and  Xj^  = t.  When  such  a path  exists,  t is 
accessible  from  s.  A path  (*)  is  a simple  path  if  there 
is  no  repetition  of  nodes  (other  than  that  implied  by  the 
notation)  . It  is  a circuit  if  “ ^0'  ^ simple  cir- 

cui t if  there  is  otherwise  no  repetition.  Note  that  simple 
paths  and  simple  circuits  may  conveniently  be  regarded 
as  sets  (rat)icr  tlian  sequences)  of  arcs. 

A graph  is  connected  if  each  node  t is  accessible  from 
from  each  node  s * t.  A tree  is  a connected  graph  that  con- 
tains no  circuit.  A spanning  subgrapli  is  one  that  uses  all 
nodes.  A matching  is  a set  of  arcs  no  two  of  which  have  a 
common  node.  A tour  is  a spanning  simple  circuit. 

For  purposes  of  illustration,  we  focus  here  on  five 
problems  of  network  optimization,  assuming  for  simplicity 
that  A is  the  domain  of  a and  (except  whore  the  contrary 
is  stated)  a ; 0.  The  graph  G should  be  undirected  in  (2) 
and  (4),  but  in  the  others  it  may  be  directed  or  undirected. 
In  all  but  (3),  the  desired  solution  may  be  regarded  as  a 
subset  of  A,  the  length  or  weight  of  such  a set  being  the 
sum  of  the  a-values  of  its  members.  (Thus  tl\e  objective 
function  is  a linear  function  of  arc  lengths.)  In  (3)  the 
values  of  a are  regarded  as  flow  capacities. 
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(1)  (Minimum  pAth)  For  two  cjivt“n  nodes  s and  t of  G, 
find  a shortest  path  from  s to  t or  conclude  that  t is 
not  accessible  from  s. 

(2)  (Minimum  spannituj  tree)  For  a connected  G,  find  a 
connected  spanning  subgraph  of  minimum  weight. 

(3)  (Maximum  flow)  For  two  given  nodes  s and  t of  G 
such  that  t is  accessible  from  s,  find  an  a-feasible 
flow  of  iiuiximum  value  from  s to  t. 

(4)  (Maximum  matcliing)  Among  all  matoliings  in  G,  find  one 
of  maximum  weight. 

(5)  (Minimum  tour)  Given  a tour  in  G,  find  a shortest 
tour . 

In  addition  to  tlie  many  practical  optimization  problems 
for  wliic)»  the  inmiediate  mathematical  model  is  one  of 
(1)  - (5),  many  other  problems  c.an  be  reduced  to  these  or  to 
a combination  of  tliem.  Ttie  reaiU'r  is  undoubtedly  faced, 
eacli  day,  witli  several  instances  of  (1).  Problem  (2)  arises 
in  the  constrvict ion  of  communication  networks.  Problem  (3) 
stems  from  an  attempt  to  evaluate  t)»c  capacity  of  the  Past- 
ern F.viropean  rail  ju'twork  to  svipport  a large-scale  conven- 
tional wai-  (see  Billera  and  Lucas,  1976);  it  and  its  relative, 
the  minimum-cost  flow  problem,  are  used  to  model  a variety  of 
transportat  ion  problems.  Though  algoritltmti  lor  (4)  are  per- 
h.aps  not  as  obviously  useful  as  those  for  (1)  - (3),  matching 
algoritlims  tiave  in  fact  been  applied  to  problems  of  personnel 
assignment,  pairing  of  macliiiu'd  parts,  scheduling  of  two- 
processor  systems,  v.irious  routing  problems  (see  Lawler  (1976) 
and  Itis  references  foi'  all  of  these),  evaluation  of  biomedical 
data  (Tanimoto,  1976),  and  finding  the  rank  of  a matrix 


(Anderson,  1975;  Klee  and  vnn  den  Driessche,  1977) . Problem 
(5)  is  a form  of  the  famous  traveling  salesman  problem,  whose 
applications  include  computer  wiring,  vehicle  routing,  and 
job  shop  scheduling  (Lenstra  and  Rinnooy  Kan,  1975) . 

For  the  many  direct  applications  of  (1)  - (4),  and  for 
the  reduction  of  other  problems  to  these,  the  best  single 
source  is  the  book  of  Lawler  (1976),  ConUiin^t^q^^l  Optimiza- 
^on:  Networks  and  Ma t r o i d s . It  will  bo  mentioned  frequently 
in  the  sequel.  The  books  or  survey  articles  of  Uerge  and 
Ghouila-llouri  (1965),  Bradley  (1975),  Busacker  and  Saaty 
(1964),  Dantzig  (1963),  Eisolt  and  von  Frajer  (1977), 
B;lmaqhraby  (1970),  Ford  and  Fulkerson  (1962),  Frank  and 
Frisclt  (1972),  Fulkerson  (1966),  Garfinkel  and  Nemhauser 
(1972),  Mu  (1969),  Iri  (1969),  Karp  (1975b),  and  Whitehouse 
(1973)  also  contain  much  useful  information  about  combinator- 
ial optimization.  For  proble'ms  reducible  to  (5)  the  basic 
references  are  Cook  (1970)  and  Karp  (1972,  1975a).  Aho 
et  al.  (1974)  contains  a good  introduction  to  this  fascinating 
subject,  and  by  far  the  most  complete  study  is  tlio  book  of 
Garey  and  Johnson  (1978?),  Computers  and  Int  ractabi 1 ity : A 
Gui^e  ^ the  Theory  of  NP-complet eness . 

Let  n = |n|  and  a = |a|,  the  numbers  of  nodes  and  arcs 
respectively.  When  convenient,  we  assume  N = (1,2, ...,n). 

In  each  of  (1)  - (2)  and  (4)  - (5),  the  desired  solution  may 
be  regarded  as  a subset  of  A or,  relative  to  a given  index- 
ing of  A,  as  an  ordered  a-tuple  of  O's  and  I's.  There 
are  2*  such  subsets,  but  many  may  be  quickly  excluded  by 
paying  some  attention  to  the  combinatorics  of  t'^e  problem, 
however,  even  .ifter  such  exclusion  and  even  for  moderate 


values  of  n,  explicit  investigation  of  all  the  remaining 
subsets  is  impossible  even  on  the  fastest  computer.  To  illus- 
trate this,  consider  the  case  in  which  a •»  0 and  the  graph 
G = (N,A)  is  undirected  and  complete,  so  that  A = n(n-l)/2. 

(1)  Shortest  paths  are  simple.  When  G is  complete  the 
number  of  simple  paths  from  s to  t is 

(2)  Kach  connected  spanning  subgraph  of  minimum  weight  is 

a tree.  When  G is  complete  the  number  of  spanning  trees  is 

n-2 

n 

(4)  When  G is  complete  there  are  [n/2\  arcs  in  each 

maxinuim  matching.  The  niunber  of  matchings  consisting  of  [n/2j 
arcs  is  when  n = 2k  and  k*^  when 

n = 2k+l. 

(5)  When  G is  complete  the  number  of  tours  (considered  as 
sets  of  edges)  is  (n-l)!/2. 

If  a computer  required  oiUy  10  seconds  (one-tenth  of 

a nanosecond)  to  investigate  a particular  spanning  tree,  the 
time  required  to  investigate  all  n^  ^ spanning  trees  vv-ould 

be  about  .01  seconds  for  n - 10,  54  hours  for  n = 15, 

5 1 4 

8.3'<10  years  for  n = 20,  and  4.5>'10  years  for  n = 25. 

This  explains  wliy  one  speaks  of  the  "combinator ial  explosion" 

in  connection  witli  such  problems,  and  why  an  algorithm  tliat  is 

merely  "finite"  may  be  useless  except  for  very  small  problems. 

The  goal  of  combinatorial  optimir.at  ion  is  to  find  algoritlims 

that  are  useful  even  for  very  large  problems. 


KHAT  IS  A coon  AI.COlU'l'llM.' 


Hv'W  is  t ho  si'Oi'C  ot  .«!>  aUiot  ithiu  to  bo  luo.isiu  Ovi?  Ono  moy 
Oi'm^Mio  till'  I'oi  t v.>i  lu.inoi-  ol  twv'  .iUjO'  1 1 Urns  tor  t ht'  s^imo  pi  ol>- 
l«‘m  on  .1  t «'w  inst.>t\oos  ot  si'i'oi.il  intoiost,  but  bow  lolovuut 
is  tb.it  to  t lio  i r I'i't  t Ol  m.UK'o  on  otlu-t  i nst  .moos?  Oiu'  m.iy  b.iso 
t bo  ooiin'.it  1 Si'n  iMi  .1  l.itui'  nui'.iboi  ot  inst.inoos  obost'ii  in  .i  lou- 
ul.it  Ol  i .iiulk'iu  t.isliion,  t'ut  tb.it  m.iy  I'o  vt'i  y o.Kponsivo  .in^i  yot 
not  piovuio  .1  loli.iblo  i nO  i o.-i  t 1 on  ot  ['ot  1 m iii.inoi'  on  inst.inoos 
tiuu-b  l.itui'i  tli.in  t lioso  ti‘stov.1.  Ut'i  i'  wt'  li.-ivo  usoC  (.itul  lu'iioo- 
ti'illi  will  uso)  t lu’  toiiiis  j^iobloiii  aiul  inst.uu,-^  in  t bo  sonso  I't 
Abo  I't  .1 1 . tl''.'-!).  K.iob  inst.moo  is  .issoo  i .it  o.l  witb  p.ii  t lou- 
l.it  nur.ioiio.il  .l.it.i  utul  .i  pioblom  is  t bo  ol.iss  of  .ill  inst.inoos 
of  .1  spov'il  lo.l  toim.  I'oi'  ox.uv.plo,  t lu'  m.ix  imum  in.itobiiuj  prob- 
loiii  is  t !u'  I'l.iss  I't  .ill  inst.inoos  ol  t bo  toim  i-ll. 

It  IS  oxpooti'.l,  ot  oinivso,  tb.it  l.ii.n'  inst.inoos  will  usu- 
.1 1 1 y bt‘  si'li  t'.l  iiu'it-  slowly  t b.in  siii.i  1 1 inst.inot's.  boi  pt  obloiiis 
ov’nooinuui  .1  .ii.ipb  C • IN, .A),  t bo  p.iinmotois.  ti  *•  |n|  .iiul 
ii  - I .A  I piovivlo  .1  n.itui.il  mi'.iiiuio  ot  t bo  si.'.o  k't  .tn  inst.tnoi'. 
Wbon  1 IS  .1  tunotion  ot  tbosi'  p.ii  .imt't  ors , .in  .lUn'i  1 1 bin  is 
s.iiv.1  to  bi'  ot  ttimo)  ooinploxity  O t i ( n , .1 ) 1 it  tboio  is  .i 
oonst.mt  o snob  tb.it  tor  .ill  C ^ tN,.Al  t bo  .lUjoritlim  lo- 
i-iuitos  .It  most  oitn,.i)  Ovniiput  .it  ion.i  1 stops.  Tbo  notion  of 
"stop"  must  bo  i nt  Ol  pt  I't  o.l  in  tt'rms  ot  an  .ippiopi  i.t  1 1'  iikkIoI  of 
oomput  .It  ion , aiul  toi  loK'v.iiioo  to  piaotioo  tbo  movlol  sboiiKl  bo 
b.isoil  ..in  tbo  i.iiulom  .noot'ss  opoi.it  ions  of  nu'.loin  oloo'tioiiio 
ov'iiipu  t o i s . Tbo  .lisoussion  boio  is  in  loiiiis  of  tbo  R.AM  iiiovlol 
of  l atuKim  .loooss  oomput  .it  ion  .losoribo.l  by  Cook  aiul  Rookbow 


(1973)  and  on  pp.  5-14  ot  Aho  ct  al.  (1974),  using  tlic  uni- 
form cost  criterion.  Without  too  mucli  distortion,  the  reader 
may  simply  inti-rpret  "stt.'’  as  a single  arithmetic  operation 
(addition,  mull ipl ication,  comparison,  etc). 

A gocxl  a 1 qor  i thin  is  one  tliat  is  polynomial  ly  bouii^ed--is 
of  complexity  Oin^’a'^)  for  some  p and  q.  And  of  course 
we'd  like  the  exponents  p and  q to  be  as  small  as  possi- 
ble. Tliis  notion,  popularized  by  f.'dmonds  (19f)5a)  and 
Cobtiam  (1965),  is  meaningful  in  theoretical  studies  and  also 
useful  in  practice.  Since  a s n^,  an  Oin^a'^)  algorithm 

is  also  0(n^’^^'’^).  However,  n and  a are  mentioned  sep- 

2 

arately  because  a is  much  less  than  n for  most  graphs 

arising  in  practice.  Note  tliat  if  an  algorithm  is  to  take 

advantage  of  the  sparseness  of  the  input  graph  G = (N,A)  in 

2 

order  to  achieve  performance  better  than  0(n  ),  then  G 
cannot  be  input  as  its  full  n^n  adjacency  matrix.  Instead, 
the  input  may  consist  of  lists  tliat  tell,  for  each  node 
i «.  N , which  other  nodes  j are  adjacent  to  i,  and  that 
also  give  the  arc  lengths  a(i,j). 

The  above  notion  of  comple.xity  involves  the  woist-case 
behavior  of  an  algorithm,  but  average-case  behavior  is  also 
important  in  many  applications.  For  this  notion  to  be  mean- 
ingful, the  sample  space  must  be  carefully  defined,  and  for 
it  to  be  useful  the  sample  space  must  be  appropriately  re- 
lated to  the  instances  in  which  the  algoritlim  is  to  be 
applied.  Section  6 discusses  the  average-case  behavior  of 
certain  algorithms,  but  otherwise  we  are  concerned  only  with 
worst-case  behavior. 


In  tlio  intorost  of  biovity,  I bo  procodiny  flitoo  {.iru- 
qrapliii  h.ivo  glossoU  ovi’i  somo  important  pracl  ioal  and 
plii  1 oRoph  ioa  1 qnost  ions  ooncoining  t (lo  oncodiiuj  of  data,  tbo 
rt' 1 a t i onsb  i p botwoon  opt  imi  7.at  ion  prolilmns  and  docision 
probloms,  ttio  olioico  of  a modol  for  oomput  at  ion , tlie  reasons 
for  enipha.s  i i no  f\.'lynomial  boundi'dness  , and  the  pitfalls 
assooiated  will)  avei  ai.|e-ease  behavior.  These  matters  ari' 
treateii  by  I'ook  and  Keekhow  (1^173),  .Mio  et  al.  (197-1), 

I.awler  (197i>),  Weide  (1977)  and  C.ari'v  and  Jotinson  (19797). 

It  sliould  also  1h'  stressed  that  if  an  algorithm  works  wi'll  in 
praotice,  frequently  handling  large  instances  with  amac.ing 
ease,  then  it  i.s  of  course  a good  and  useful  algorithm  even 
though  it  m.iy  not  be  "good"  in  the  technical  sense  used  hert'. 
That  is  true,  for  e.xample,  of  the  simple.x  algorithm  of  line.ir 
programmi ng.  Its  woist-case  behavior  is  not  polynomially 
bt'unded  (Klee  and  Minty,  1977;  deroslow,  1973;  Avis  and 
Chv.ital,  197s),  even  for  minimum  cost  flow  problems  (".adeh, 
197.3),  and  i t .s  .iverage-c.ise  behavior  has  not  been  pi  oved  to 
be  pol>nomia  1 ly  boundcvi  in  any  sense  th.it  is  convincingly 
related  to  computa  t i on.i  1 practice.  Ni'vertheK'ss,  there  is 
strong  empiiic.il  evidence  lh.it  the  .iver.ige-case  behavior  is 
polynomial  ly  bminded  (Ikintcig,  19i'.3,  p.  IsO).  On  the  other 
hand,  speci.il  network  algorithms  are  .ipt  to  be  faster  th.in 
the  simplex  aliioi  ithm  for  problems  to  which  they  too  can  be 
applied  (hradley,  1975,  p.  77*3). 
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3.  WHAT  IS  THE  STATE  OF  THE  ART? 

Problems  (1)  - (5)  arc  admittedly  special  in  nature,  but 
combinatorial  optimization  is  concerned  witli  special  prob- 
lems. Thus  it  may  bo  reasonable  to  judge  the  state  of  the 
art  by  what  is  known  about  (1)  - (5).  These  proiilcms  appe-ar 
in  various  ways  in  the  next  four  sections,  in  which  the  state 
of  the  art  is  represented  by  three  each  of  firimary  land- 
marks, secondary  landmarks,  exciting  recent  advances,  and 
directions  for  future  lescarch.  if  permitted  to  make  a 

i 

sununary  jiuigment  in  architectural  terms,  we  might  say  that  J 

the  subject  of  combinatorial  optimization  has  gradually  bc'cn 

transformcid  from  the  rococo  to  the  mc-rely  baroque  (tlius 

reversing  the  evolution  that  occurred  in  architecture),  but 

is  still  far  from  the  simple  elegance  of  the  classic  orders 

or  the  sleek  functionalism  of  modern  architecture.  This 

refers  to  a definition  of  rococo  as  "a  me,ininglcss  assemblage 

of  scrolls  and  crimped  conventional  shellwork,  wrought  into 

all  sorts  of  irregular  and  indescribable  forms,"  and  of 

baroque  as  "odd,  grotesque,  bizarre,  having  unusual  form.i- 

tion."  The  sciolls  and  crimped  conventional  sliellwork  arc 

the  many  papers  in  the  subject  which  repeat,  in  only  slightly 

different  form,  algorithms  that  appear  in  earlier  papers. 

As  the  subject  has  developed,  these  "scrolls"  have  fallen 
away  (been  forgotten)  and  interrelationships  have  been  dis- 
covered among  much  of  what  remains.  Tire  subject  has  even 
achieved  a considerable  degree  of  architectural  unity  and  may 
now  properly  be  regarded  as  a branch  of  mathenuitics , or  of 


I 


Tl 

f 
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opi'i'.U  ions  losoarch , oi'  of  coripiitor  soiencf.  llowovor,  the 
is  ot'ftainly  of  "uimsual  formation,"  liooauso  in  t ho 
lot\i|  st'aioli  for  "»iooil"  algorithms  for  (1)  - (5),  none  has 
hoon  provi'il  "bt?st  " rolativo  to  tho  HAM  moiiol  of  computation 


4.  TlIKtOH  I’KIMAUY  hANDMAKKS 


Tlnoo  1 il^^'s  ot  investigation  stanci  out  for  thoir  intrin- 
sic liopt  li  aiul  thoir  infliunico  on  tlio  rest  I'f  t lu'  field.  They 
are  associated  with  prol)lom  (3),  wi  tli  pr<.)blom  (4),  and  loss 
directly  with  (5).  Wi'  consider  only  t tie  iliroctcd  case  of  (3) 
because  its  foimulation  is  slightly  simplt'r. 


A.  Tl>o  Maximum  I'low  Pi  obi  cm 
aiul  Its  Ham i t icat  ions 


Kor  o.ich  real  function  4'  on  A anil  oacli  node  k i N, 

let 


^ (i,k).A 


4'  ( i . k)  . 


The  function  4'  is  an  (s,t)-fliw  if  it  satisfies  the  con- 
servat  ion  I’ondi  t ions 


(a) 


li|^(4')  «-  0 


for  all  k 


N - {s,t } 


(at  any  intormodiato  node,  the  outflow  cquala  the  inflow), 
from  whic)>  it  follows  that  ■“  (the  net  outflow 

at  the  source  s equals  the  net  inflow  at  the  sink  t) . 

The  nun\ber  is  called  the  s-v;il_ue  of  ij* . The  flow  if 

is  g- feasible  if  it  satisfies  the  capacity  constraitits 

(b)  0 ii  i{'(i,j)  a(i,j)  for  all  (i,j)  < A. 

I’roblt'in  (3)  asks  for  an  g-feasiblt'  flow  of  maximum  s-value. 
This  is  a lim'ar  proqramminq  problem  in  the  variables  t(i»3) 
with  objective  function  The  feasible  reqion  is  the  con- 
vex polytope  defined  by  the  equalities  (a)  and  the  in- 

equalities (b)  . Thus  (J)  c’an  be  solved  by  general  lineaj' 
proqjamminq  alqorithms,  but  it  is  also  amenable  to  the 
special  methods,  of  combiiuUorial  optimization. 

An  (s , t ) -cut  is  a set  of  arcs  wliose  removal  from  G 
leaves  no  directed  path  from  s to  t.  The  initial  question 
about  rail  capacities  led  to  the  max-1  low  min-cut  theorem  of 
Kord  and  Fulkerson  (I'lho)  (d iscoveri'il  also  by  Klias  et  al  . , 
1956)  and  to  their  book  on  fMow^  i^l  Network^;  (I’oid  and 
Fulkerson,  196.3),  whicli  stimulated  much  research  on  combina- 
torial optimization.  I'he  tlieorem  asserts  that  t lie  maximum  of 
the  s-values  of  the  g-£easible  (s,t)-flows  is  equal  to  the 
minimum  of  the  weights  of  the  (s,t)-cuts.  The  f 1 ow-auqment- 
inq  paths  used  to  prove  tliis  theorem  ai;^e  used  als.o  in  an 
algorithm  for  finding  a maximum  flow.  As  desci  il'ed  by  Ford 
and  Fulkerson  (1<157,  l'>62),  the  algorithm  is  not  p<.i  I ynomi  a 1 1 y 
bounded  but  it  has  been  applied  successfully  to  a large  num- 
ber of  practic.il  probleni.s.  F.dmonds  aiul  Karp  (I**?.’)  use  the 


method  to  produce  an  O(na^)  algorithm  (see  pp.  116-120  of 
Lawler  (1976)  for  an  exposition  of  their  work) , and 

Kinariwala  and  Rao  (1977)  have  a different  O(n^)  algorithm. 
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The  0(n  a)  maximum  flow  algorithm  of  Dime  (1970)  is  im- 
proved by  Karzanov  (1974)  to  O(n^).  Section  6 contains 
additional  comments  on  the  Dinic-Karzanov  algorithm. 

Space  permits  us  to  mention  only  a few  of  the  many  con- 
sequences and  ramifications  of  network  flow  theory.  An 
early  observation  was  that  if  the  capacity  function  a is 
integer-valued  then  all  vertices  of  have  integer  co- 
ordinates and  hence  any  linear  integer  program  over  can 

be  solved  as  an  ordinary  linear  program,  without  worrying 
about  the  integrality  constraints.  Tliat  led  to  the  search 
for  other  integer  programs  which  could  be  solved  as  linear 
programs  and  thus  to  the  study  of  totally  unimodular  matrices. 
See  Garfinkel  and  Nemhauser  (1972),  Hu  (1969),  Lawler  (1976) 
and  their  references.  In  a different  but  related  direction, 
the  tlieory  of  blocking  and  antiblocking  polyhedra  (Fulkerson, 
1971;  Chvatal,  1976,  pp.  316-318)  may  be  regarded  as  a far- 
reaching  generalization  of  the  max-flow  min-cut  tlieorom. 

Suppose,  in  (3),  that  each  arc  (i,j)  t A has  not  only  a 
capacity  a(i,j)  but  also  a cost  Y(i».i)  of  sending  a unit 
of  flow  along  the  arc,  and  suppose  there  exists  an 
a-feasible  flow  of  given  s-value  v.  Ttic  minimum-cost  flow 
problem  asks  for  such  a flow  iC  whose  cost 

^li  -w  A Y(i»j)4'(i>j)  is  minimum.  The  primal -dual  approach 
to  finding  minimum  cost  flows  led  to  the  general  primal-dual 
linear  programming  algorithm  of  Dantzig  et  al.  (1956). 
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Many  other  problems  of  combi nator ial  optimization  can  be 
reduced  to  the  maximum  flow  problem  or  the  minimum-cost  flow 
problem.  Sec  the  next  paragrapJi  for  one  example  and  Lawler 
(1976)  for  others.  Plainly  if  a single  publication  is  to  be 
named  ns  the  most  important  landmark  of  combinatorial  optimi- 
zation, it  must  be  the  book  of  Ford  and  Fulkerson  (1962).  It 
is  now  outdated  in  many  respects,  but  that  merely  indicates 
its  success  in  stimulating  tlie  development  of  a new  branch 
of  nuit hemat ics . 


B.  Maximum  Matching 

Because  of  its  close  relationship  to  other  combinatorial 
topics,  special  attention  lias  been  paid  to  the  case  of  the 
maximum  matching  problem  (4)  in  which  each  arc  is  of  weight 
1 (cardinal i ty  weighting)  and  G is  bipartite.  Let  P and 
Q denote  the  two  parts  of  the  node  set,  and  form  a directed 
graph  D by  adding  new  nodes  s and  t,  now  arcs  (s,p) 
and  (q,t)  for  all  p e P and  q e 0,  and  directing  all 
old  arcs  from  P to  Q.  Let  each  arc  have  capacity  1.  Then 
the  maximum  matching  problem  for  G is  easily  seen  to  bo 
equivalent  to  the  maximum  flow  problem  for  0.  Hence  the 
maximum  cardinality  matching  problem  for  bipartite  graphs  can 
be  solved  as  a maximum  flow  problem,  but  it  can  be  solved  in 
other  ways  as  well. 

A theorem  of  Borgo  (1957)  asserts  that  for  cardinality 
weighting,  a matching  M is  maximum  if  and  only  if  it  admits 
no  augmenting  path . This  is  a simple  path  S whose  end  node 


arc  not  covoreci  by  M niul  wboso  arcs  belong  alternately  to 
A~M  and  to  M.  If  ScA  is  such  a path  and  M'  is  the 
symmetric  difference  HiJ>S  - (M~S)  n (S'M)  then  M*  is  a 
matching  with  |m'|  »=  |m1  + 1.  Suppose,  conversely,  ttiat 
there  exists  a matching  M'  with  lM'|  > |n1.  Following 
Norman  ai\d  Kahin  (19[>9),  consider  tiie  grapli  G'  = (N,A') 
whore  A'  “ M®M' , Each  node  of  G'  is  of  valence  s 2 , so 
G*  is  a node-disjoint  union  of  isolated  nodes,  circuits 
whose  arcs  alternate  between  M and  M',  and  alternating 
patlis.  Since  |M'|  > M»  at  least  one  of  tlie  alternating 
paths  is  an  augmenting  path  for  M. 

For  the  cardinality  weighted  bipartite  case,  augmenting 
paths  are  relatively  easy  to  find;  cin  0(na)  maximum  match- 
ing algorithm  that  is  unusually  easy  to  understand,  analyze, 
atid  program  can  be  obtained  by  specializing  the  method  of 
besler  and  Ilakimi  (1969).  By  finding  several  augmenting 

paths  simultaneously , Ilopcroft  and  Karp  (1973)  produce  an 

V; 

elegant  0(n  a)  algorithm.  Wlien  G is  bipartite  but  the 

arc-weigt»t ing  a is  unrestricted,  (4)  is  called  the  assign- 

ment  problem.  Tlte  best  available  algoritlims  are  of 
2 

complexity  0(pq  ),  where  p and  q are  tlio  cardinalities 
of  the  two  parts  of  the  node-set  (Kuhn,  1955;  Tomizawa,  1972; 
bawler,  1976,  pp.  201-207). 

Ttie  wor);  of  Edmoi\ds  and  others  on  nonbiparti  te  matcliing 
constitutes  a second  primary  landmark  in  the  development  of 


combinatorial  optimization.  Augmenting  paths  play  a role 
here  too,  but  it  is  much  more  difficult  to  find  them  effi- 
ciently Ilian  in  the  bipartite  case.  For  several  years  there 
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was  doubt  about  the  existence  of  a good  nonbi partite  maximum 
matching  algorithm,  even  for  cardinality  weighting.  However, 

4 

Edmonds  (1965a)  found  an  0(n  ) algorithm  and  then  (1965b) 
showed  how  to  extend  it  to  arbitrary  weightings.  His  imple- 
mentation is  improved  in  the  O(.n^)  algorithms  of  Gabow 
(1973,  1976)  and  Lawler  (1976,  pp.  217-263).  See  also  Feicht, 
Hec)c  and  Pape  (1977).  For  cardinality  weighting  only,  a 
fairly  simple  0(na)  implementation  is  provided  by  Kameda 
and  Munro  (1974)  , and  the  ideas  of  Hoperoft  and  Karp  (1973) 
are  extended  by  Even  and  Kariv  (1975)  to  yield  an  0(n  ) 

maximum  cardinality  matching  algoritlun  for  general  graphs. 

It  seems  safe  to  say  the  latter  has  not  been  widely  checlced, 
for  the  joint  paper  is  quite  condensed  and  the  detailed  expo- 
sition, in  the  dissertation  of  Kariv  (1976) , requires  more 
than  200  pages! 

The  nonbipartite  maximum  matching  problem  is  certainly 
one  of  the  deepest  for  which  good  algorithms  are  )cnown,  and 
the  rationale  behind  Edmonds's  approach  has  been  influential 
in  other  ways  that  space  does  not  permit  us  to  describe 
(Edmonds,  1970;  Chvatal,  1973).  A boo)<  on  matching  is  being 
written  by  Edmonds  and  Pullcyblan)c  (1978?)  . 


C.  NP-Completeness 

A third  primary  landmarlc  of  combinatorial  optimization 
was  the  discovery  by  Coo)c  (1970)  and  Karp  (1972,  1975a)  of  a 
large  class  NPC  of  combinatorial  decision  problems  such  that 
(i)  NPC  includes  many  important  and  difficult  problems  ' 


of  {'ond>irK'itorial  optimization  (for  oxample,  a close  relative 
of  (5)  )--prol)lems  for  which  good  algorithms  have  vainly  been 
sought  for  years  by  many  researchers; 

(ii)  many  problems  in  NPC  are  at  least  superficially  sim- 
ilar to  problems  for  wliich  good  algorithms  are  known; 

(iii)  if  even  one  member  of  NPC  admits  a good  algorithm 
then  they  all  do. 

The  precise  definition  of  NPC  depends  on  the  notions  of  a 
de  t e rm  i n i s t i c Turing  macji_^i\e  (DTM)  , a nondett'rminist  ic  Tur  i ng 
machine  (NTM)  , the  coinpl  exi  ty  of  a TM,  and  other  notions  from 
the  theory  of  computation.  The  details  are  complicated,  but 
some  may  be  omitted  because  it  is  known  that  a problem  admits 
a good  algorithm  for  the  R.'tM  model  of  computation  if  and  only 
if  it  admits  one  for  the  DTM  model.  Thus  for  present  pur- 
ix>ses  a l>TM  may  be  regarded  as  an  ordinary  computer  program, 
and  an  NTM  may  bc'  regarded  as  an  animate  OTM  so  fecund  and 
imaginative  that  it  can,  at  the  start  of  a computation,  make 
a finite  number  k of  guesses  and  immedi.itely  split  into  k 
replicas  of  itself  (to  be  run  simultaneously),  one  to  inves- 
tigate the  consequences  of  each  guess.  The  complexity  of  a 
TM  is  a function  that  tells  how  long  it  t.akes  to  process  sets 
of  input  data  of  various  sizes. 

Let  r (re.sp.  NP  denote  the  class  of  all  decision  problems 
solvable  by  bTM's  ^resp.  NTM's,^  of  polynomial  ly  bounded  com- 
plexity. Then  P consists  of  all  decision  problems  that  admit 
good  algorithms.  And  NP  consists  of  all  decision  problems 
for  wliich  an  affirmative  answer  can  be  obtained  (when  correct) 
by  applying  a polynomial  ly  bounded  "checking  algorithm"  to  tlie 
appropriate  guess  output  by  a finite  "guessing  algorithm." 
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(For  oxainplt',  (xVyVr:)AKAy  ' sat  i ssf  iablo  but 
xA(vVz)A  (xV?.)A  (xVyVz)  is  not.)  It  is  unknown  wln-thcr  there 
is  u good  algoritlim  for  deciding  tlie  sat  i sf  i.il)i  1 i t y of  an 
arbitrary  expressicin  of  lengtli  n.  Cook  (1970)  sliowed  t lie 
satisfiability  problem  to  be  NP-complete  and  then  Karp  (197?, 
1975.1)  establislied  the  NP-complet  eness  of  many  otlier  problems 
Y by  sliowing  tliat  Y < NP  (usually  easy)  and  ttiat  tile  satis- 
fiability problem,  or  some  c'ther  problem  known  to  be  NP- 
compli'ti',  is  reducible  to  Y.  Garey  aiul  Johnson  (1978?)  have 
not  only  the  most  extensive  list  of  NP-eonipIete  problems,  but 
also  an  excellent  discussion  of  various  strategies  for  proving 
NP-completeness . 

In  addition  to  the  decision  problem  most  closely  ri'lated 
to  (5),  the  following  is  also  NP-complet  t' . 

(S')  Determine  whether  G admits  a tour. 

It  is  easy  to  reduce  (5')  to  the  special  case  of  (5)  in 
which  all  arc-lengths  are  0 or  1.  For  let 
A'  {( 1 ,?),(?,  3)  ,...,  (ti-1 , ii)  , (ii , 1 ) ) and  consider  the  aug- 
mented graph  G'  - (N,Ai.iA').  hot  each  aie  in 

A ^resp.  A'~a)  be  of  length  0 ^resp.  l)  . Then  G admits  a 
tour  if  and  only  if  G'  admits  a tour  of  length  0. 

Note  that  if  even  one  NP-hard  problem  admits  a good  .algo- 
rithm, then  P = NP.  In  particular,  P = NP  if  and  only  if 
(S')  admits  a good  algotit)\m.  Tlacse  facts,  (i)  above,  and  the 
fact  that  NTM's  seem  intuitively  to  be  mueli  more  powerful 
than  D'l'M's,  are  reg.irded  by  many  researchers  as  almost  conclu- 
sive evidet»ce  that  P » NP.  However,  it.  is  also  conceivable 


that  (5)  or  (5')  admits  a good  algoritluii  but.  does  not  admit 
one  of  comple.xity  less  tti.an  0(n^’)  where  p is  very  large 
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and  the  algorithm  therefore  so  complicated  that  it  is  un- 
likely ever  to  be  found.  Good  algorithms  have  been  useful  in 
practice  because  for  problems  of  practical  interest  they  have 
seldom  been  worse  than  O(n^)  and  the  multipliers  c have 
not  been  excessively  large.  Does  this  say  more  about  the 
complexity  of  the  interesting  problems  in  P or  about  the 
limitations  of  human  ingenuity  in  devising  algorithms? 

We  mention  only  a few  more  NP-complete  combinatorial 
problems,  mainly  to  illustrate  the  aspect  (ii)  stated  above 
and  to  prepare  for  later  comments.  In  precise  treatments, 
the  term  "NP-complete"  is  usually  reserved  for  decision 
problems  such  as  (S')  and  not  applied  to  optimization  prob- 
lems such  as  those  stated  below.  That  distinction  is 
ignored  here,  for  our  aim  is  only  to  provide  a rough  under- 
standing of  NP-completencss . The  problems  listed  below  all 
appear  (in  slightly  different  forms)  in  the  list  of  Garey  and 
Johnson  (1978?),  and  all  but  (2')  are  given  by  Karp  (1972, 
1975a) . 

The  following  problem  should  be  compared  with  (1) . It  is 
NP-complete  even  in  the  cardinality  weighted  version. 

(1')  For  two  given  nodes  s and  t of  G,  find  a long- 
est simple  path  from  s to  t. 

The  following  should  be  compared  with  (2) . For  each 
fixed  d s 4 , it  is  NP-complete  even  in  the  version  in 
which  all  arc  weights  are  0 or  1.  The  problem  arose  in 
the  design  of  telephone  networks. 

(2')  Given  that  G admits  a spanning  tree  in  which  no 


simple  path  has  more  than  d arcs,  find  such  a spanning  tree 
of  minimum  weight. 


J 


For  the  cardinality  weighting,  (4)  is  equivalent  to 
asking  for  a smallest  set  of  arcs  that  covers  all  nonisolated 
nodes  of  G.  Though  (4)  belongs  to  P,  the  following  prob- 
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lem  is  NP-complete. 

(4')  Find  a smallest  set  of  nodes  that  intersects  all  arcs 
of  G. 

A set  X c N is  independent  if  no  two  of  its  members  are 
adjacent  (joined  by  an  edge) . Since  X is  independent  if 
and  only  if  N~X  intersects  all  arcs,  the  following  problem 
is  equivalent  to  (4'). 

(4'')  Find  a largest  independent  set  of  nodes  in  G. 

The  assignment  problem,  mentioned  earlier,  can  be  stated 
as  follows  in  terms  of  a nonnegative  nxn  matrix 
Among  all  sets  S of  n pairs  (i,j)  that  include  at  most 
one  pair  from  each  row  and  coliamn,  find  an  S for  which 
Z..  c a • • is  maximum.  This  problem  admits  an  O(n^) 
algorithm,  but  the  3-dimensional  assignment  problem  is 


NP-complete.  It  asks,  for  a nonnegative  nxnxn  array 

(ci . .,  ) : 
igk 

(4''')  Among  all  sets  S of  n triples  (i,j,k)  such 

that  no  two  members  of  S agree  in  any  coordinate,  find  one 

for  which  ^ is  maximum. 

(1, ] ,k)  £ S i]k 

' Suppose,  in  fact,  that  each  a. ..  is  0 or  1 and  the 
1 13X 

problem  is  merely  to  determine  whether  the  maximum  in  (4'*') 

' is  equal  to  n.  Even  that  restricted  form  of  the  problem 
is  NP-complete. 
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5.  TllKi;^;  Sl'OONl'AKY  LANDMARKS 

Anioiu)  tlu>  Nr-liartl  ['lolilems,  tlio  tr.wi'liiui  s.jIosiuah  j'l'oh- 
l»'m  (f))  anil  its  ro  1 .it  i vt's  st.itKl  init  for  t ho  onuniiit  ot  .itloii- 
tiou  thoy  h.ivo  loooivovt  .iiul  t ho  vaiioty  of  lu.iotic.il 
a i t iiat  ions  in  wliioh  thoy  h.ivo  .iiiaon.  Tho  aamo  is  1 1 no  of 
tho  shoitost -('.It  h piol'lor.i  (1)  .imoiu;)  thoso  for  which  iiood 
.1 1 iKT  i t hms  .III'  known.  Tht-  col  1 1'l' t i ons  of  nlnorithms  lio.il  incj 
with  thosi'  piobloms  fotm  two  soconil.iry  l.ituim.irka  of  coiahin.i- 
tori.il  opt  inii  .:.il  ion . .A  t li  i nl  is  tho  stuJy  of  o-pt  imiz.it  ion  in 
imii'poiulonoo  systi'ms,  wliioh  oiicjin.iti'd  from  tho  minimum 
ap.inniiui  tioo  prol'lom  (»).  A fourth,  not  disousst'd  hori',  is 
tho  p.ii't  of  polyhodr.ll  comhin.itor  ics  th.it  studios  intor- 
rol.1t  ionsliips  .imonu  oombin.itor  i.il  opt  imi  z.it  ion  , lino.ir 
prour.immi  iKi  du.ility,  .ind  tho  f.ici.il  structuro  of  oonvo.x 
jx^lyhodra  (K.dmonds,  l'U.5,  I'-iTO;  Kulkorson,  1^71, 

Lov.iaz,  1S7.',  l‘)77;  I’hv.it  .1 1 , 197.1,  1975;  H.U  inski  .ind 
hotfm.in,  19757). 

A.  Tho  Tr.ivol  i 111)  S.ilesm.in 

Somo  import.int  p.ipors  from  tho  o.irly  liistory  of  tho 
tr.ivi'lini)  s.iloam.in  problom  .nro  thoso  of  K.istm.in  (1958), 
r'.inlziij  ot  .il.  (1959),  Littlo  ot  .il.  (19t>J),  Lin  (1955), 
tlomi'ry  (I'lob)  .ind  Sh.ipiro  (1955).  A comprotionsivo  survoy  of 
work  up  to  1958  is  prov'idod  by  hollmoro  .ind  Nomhau.sor  (19oS). 
Tho  presont  situ.ition  is  aumm.irizod  bolow. 


Though  C.ilmoio  aiivi  Gomoty  (19t>-I),  Gyr.to  (1973)  v^iui 
Gaif  iiiki'l  (1977)  )uivi.'  foutul  a Uior  1 1 )\ms  tor  r.omo  UMOtiil 

spooial  casfs  of  (S),  ilovo  1 oonont s of  Dili;  sot  t aro  limitt'd 
by  till'  fact  tliat  (f>)  is  N)’-(iaid  oven  foi  flu-  special  case  in 
wliic)i  G is  inuliiecti'd  and  (a)  eacli  arc- len<:jt  )i  is  0 or  1, 
or  (b)  G is  complete  and  its  node-set  lies  in  the  I'.uclidean 
plane,  Die  leiuitti  of  eac)i  arc  be  i na  t tu‘  distance  between  its 
nodes.  Tlie  second  result  is  diu'  to  Gan'y,  Gialiam  .ind 
Jolmson  (l‘)7i,)  and  I’apad  imi  t r iou  vl977).  Tlie  first  follows 
from  Nb-cv>mplett'ness  of  tbe  undirected  veision  of  (S'), 
wliicti  liolds  even  for  tlie  special  case  in  wtiicti  G is  a 
3-valent  3-connectei.l  planar  cirapli  (Garey,  dolinson  and 
Tar  j an , 1 9 7i> ) . 

Wlien  a problem  calls  for  minimiration  of  a nonticaat  ive 
function  f,  an  alqoritlim  for  tlie  problem  is  t'-a_pproximate 
if  eacli  "solution"  output  by  t be  algorithm  lias  f-value  .it 
most  1 -» r times  Die  optimum  value.  Now  suppose  that  r * Nl' 
and  . is  (for  some  :)  an  f-approximate  .ilaorithm  for 

the  undirected  case  of  (S).  Then  Sahni  and  Gonrale:*  (197i') 
show  A^_  is  not  polynomially  bounded,  while  Pap.idimi  t r iou 
and  Sti'iijlitr  (1077)  rdiow  that  if  A is  a local  sixirch 

t 

algorithm  of  the  sort  described  below,  even  the  search  phase 
is  not  polynomially  'oounded. 

When  G is  undirected  the  traveling  salesn.an  pioblem 
may  be  approached  by  the  local  search  methods  of  bin  (lOtifi), 
Reiter  and  Sherman  (lOuS),  Karg  and  Thompson  (1064)  as  im- 
proved by  Raymond  (lOtiO),  many  others.  See  Bellmore  and 

Nemliauser  (1068),  Garfinl^el  and  Nemhauser  (I077),  Savage  et 
al . (1976),  and  rap.idimitriou  and  Steiglitr  (1977)  for  a few 


r 
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ru'fo  ro  f t-Ti’iiCi's  . In  ttu'iJO  1 gor  i t hjii.-; , one  I'tnils  with  an 
arbitrary  tour  and  tlu'n,  by  soaiohing  a sot  of  tours  wliicli 
aro  oonsidorod  its  lu'iqlibors,  oitlu-r  finds  a slioitor  tour  and 
usos  it  as  a now  starting  point  or,  if  no  shortor  four  is 
found  in  t!us  way,  torminatos.  (for  osaraplo,  tiio  noighbors 
of  a tour  T might  bo  all  tours  T'  obtainablo  from  T by 
roplaoitiv)  tini'o  suooosrivo  aios  iw,\),  {x,y)  ,ind  iy,  = } of 

T by  t ho  aros  {w,y},  {y,x'  and  {x,;:}.)  Tlio  rosulting 

tour  is  looally  optimal  rolativo  to  tho  no  i ghlH'i  hood  systoiu 
undorlying  tho  soaroh  proooduro.  Sinoi'  tho  looally  optimal 
solrtions  ofton  turn  out  to  bo  globally  optimal  or  olose  to 
that,  and  sinoo  tho  computation  time  is  ofton  modest,  those 
methods  aro  frequently  used  in  attacking  largo  instances  of 
(5).  Intrinsic  limitations  are  indicati-d  by  tlu'  theorem  of 
I'apad  imi  t r iou  and  Stoiglit:^  (1^177)  and  by  tho  fact  that  oven 
if  r - Nl'  no  exact  (0-approx  imat  o ) local  soar^-h  method  can 
bo  polynomial  ly  bounded  (Savage  ot  al.,  ll'7t').  See  Coldon 
(1977)  for  statistical  analysis  of  another  heuristic  ap- 
proach to  (S). 

Christofidos  (197t\)  has  an  0(n'^i  -'-approximate  algorithm 
for  tho  special  case  of  (h)  in  ichich  G is  a complete  un- 
directed graph  whoso  arc-longth.s  satisfy  tho  triangle 
inequality  (a(i,j)  - a(i,k)  + a(k,j)).  It  first  finds  a 

minimum  spanning  tree  and  then  solves  a matching  problem. 

His  analysis  is  sharpened  by  Cornuojols  and  Nomhausor  (197S). 

When  G is  undirected  and  one  seeks  a tour  that  is 
definitely  optimal,  tho  appioach  of  Held  and  K.irp  (1'170, 

1971)  is  called  for.  It  combines  a bi  aiu'(i-and-bound  proce- 
dure (for  sui coys  of  such  procedures,  see  Lawler  and  Wood, 


19h(i;  Hittc'ii,  1970;  C.i'of  fr  i on  citul  M.irr.tc'ii,  1972; 

anil  Clarfinki'l,  1971!)  with  an  ar.cont  mi'thoil  that  involvos 
wlu'to  t hi'.si'  ronsisL  of  ,i  troo  with  noiU;-r.ct 
{2,...,t>)  lOMolhi'i'  with  two  arcs  incident  to  node  1.  oince 
tourr.  ai'c  simply  I-ttccs  in  wli  i cli  cacli  node  is  of  valence  2, 
ii  minimum  1 -I ree  that  is  a tour  is  in  fact  a minimum  tour. 
Thi'  viseenl  mottiod  is  based  on  t l\e  faid  that  n\inimum  1-tteer. 
are  eas.y  to  find,  and  ttiat  certain  t i ansfoiniat  i on::  of  aic- 
leiKith  pre.serve  minimum  tours  but  may  produce  new  minimum 
l-trees.  The  approach  of  Held  anil  Karp  ha:;  beei:  refined  by 
lli'lil  et  al.  (197-1),  llelbiq  llaiuaui  and  Krarup  ()97'1),  and 
•SmitJi  and  Tliomp.son  (J  977). 

for  the  t ravel ituj  salesman  problem  (S),  mil  ike  the  short- 
est patli  prolilem  (1),  transition  from  the  "symmetric" 

(unili  reet  ed)  ca:;c  to  the  tieneral  "asymmetric"  (directed)  case 
ro:;ults  in  a siiinificant  reduction  in  t hi’  size  of  instances 
that  can  be  :;olved  in  a reasonable  amount  I'l  tirai'.  for  tlu' 
general  ca:;e,  the  be::!  exact  mcthi’d  is  apparently  a mollifica- 
tion, due  to  Smith  I't  al.  (1977),  of  earlier  methods  of 
f'Slman  (1951!),  Shapiro  ( 1 977),  and  Hellmore  and  Maloni- 
(1968).  Tlu'y  all  ri’ly  on  the  fact  that  there  is  a good 
algorithm  for  miniiiiizing  the  sum  a(n)  = a ( i , ii  ( i ) ) 

over  all  'pet  mut  at  i on::  ti:N-*N  (this  is  a form  of  the  assign- 
ment ’problem),  while  (5)  requires  m i ti  i m i za  t i on  of  a(:i)  over 
the  cyclic  permutations. 

Some  approximate  algorithms  for  (5)  that  have  iiood 
averago-t i me  behavior  are  mentioned  in  Section  0. 


U.  Shoftf'sl  r.it  lu? 

;;iu't  I i';.l  I'.it  h ,1  U)0'  1 l tin\f!  woio  r-iu  voyivi  by  I'tt'yfiii! 
and  il  i 1 1\  I'y  l'v>n\?;i'hKi'  (1'’7.')  niui  l.nwliM  (l'>7(i) 

In  vu'Vv'  ol  Ih.vii'  ntvulic'ri,  t h.'  inn  vi'ys  by  brain  I'y  (I'W'')  anil 
I.awli'i  (l''7('M,  aiiil  ( li<-  1' i h 1 i oa  i apli  i (’a  ro  1 1 c'lM  ni  by 

.iiul  t’aj'i'  (l'>77),  wi'  niaki'  t'n  1 y a fi'Vi'  i omarkp  hrri' 
lathiM  than  at  t rmpt  i nv)  to  rnniiinai  i 70  a taivic'  boity  of  niatoiial. 

t’ri'l'lom  (1)  in  visually  solvoil  ars  pai  t of 
(1*1  l\n  a aivon  noiU'  s of  vl , f i ml  short  osl  paths  from 
s to  all  ot  lu'i  noili's  aoooss  1 bl  r*  from  s. 

Ono  mvaht  think  (1)  oouUl  bo  solvoil  s i an  i ( i oant  1 y faslv'i  than 
()*).  I'll, It  is  I I IK'  for  sons'  i nt  t' j ost  i iva  spi'oial  olassv's  of 
at  aphs  (Maillook,  1"77,  l''7(!),  but  piobably  not  foi 

ai'noial  araphs. 

i'.i'voial  aooil  alaorithms  ai<'  availablo  (01  (1*1,  Iho  bi'sl 

kni'wn  b<’ i na  I In-  (t(i/)  ptoi'i'.lino  I'f  Pi  jkslia  (I"'-'".  I'y  a 
liimplo  ns('  I'f  piioiity  a"'''"'''>  '*  alsi'  bo  i m)' 1 v'mont  ovl  in 

tins'  ('(a  1 1'a  nl  (bavi'li'i,  l’'7t'),  an  ailvanlaa*'  tot  raittioiont 
ly  spaisi'  aiaphs.  t'y  a nii'i  o oompl  ioat  I'll  um'  I'f  piii'iity 
auom's,  Johnson  (l‘>771  shi"\'s  that  f I’t  oaoh  f t xoil  posit  i vi’ 
intoaor  k tlu'ii'  is  an  iniplons'iit  at  ion  that  i tins  in  t inio 
i'(min  (n  1 a,  a 1 oa  u)).  Whon  all  aio  Iv'iiaths  aro 

I'ositivv'  inti'aorr.  t tu'  alaiifithm  of  Waanoi  il'>7ttl  solvos  (1*1 
in  tins’  v'(max  (n,a,(l)),  whoro  J vs  t ho  nvaximvinv  I'f  t lu' 
lonaths  vif  shvirtost  paths  f 1 oni  s to  otls'i-  tvoJos. 
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I ! 

Applyjnq  Dijkr.tra'r.  prococlurc’  ri  times  yields  an  0(n  ) 
solution  of 

(1**)  Kor  each  pair  (s,t)  of  nodes  of  G,  find  a short- 
est path  from  s to  t . 

The  clt'vor  but  complicated  approach  of  Kredman  (197G)  sliows 

3 y? 

that  witli  0(ir  loq  loq  n/  loq  n)  proproccssiruj  an  0(n  ) 

solution  of  (1**)  is  available.  The  proproccssinq  consists 

of  compilinq  a table  that  is  tlien  used  for  all  instances  of  a 

given  size  n. 

The  most  (>legant  solution  of  (1**)  is  the  O(n^)  algo- 
ritluu  of  Floyd  (1962).  Fxtend  tl>e  definition  of  arc-length 
by  setting  ct  ( i , j ) = wlien  (i,j)  is  not  an  arc,  and  then 
proceed  as  follows: 
begin 

for  k •<  1 until  n ^ 
ftir  i ■<  1 unj^il  n do 
tor  j 1 unt  i 1 n 

a(i,j)  ••  min  {u(i,j),  a ( i , k ) + a(k,j)) 

end . 

Wlien  this  comi>ulation  terminates,  a(i,i)  is  the  length  of 
a sliortest  circuit  through  node  i,  and  when  i f j a(i,j) 
is  tl»e  length  ol  a shortest  path  from  i to  j.  From  this 
information  it  is  easy  to  find  the  paths  themselves. 

In  contrast  to  the  sliortest-path  algorithms  mentioned 
earlier,  Floyd's  algorithm  does  not  require  nonnegative  arc 
lengths.  It  applies  to  an  arbitrary  directed  network  in 
which  no  circuit  is  of  negative  Ifuigth.  A related  algoritliin 
of  Yuval  (1976),  using  the  fast  multiplication  of  matrices 


duo  to  Stiasscn  (1969)  and  based  on  an  oxtonnion  of  t lie  RAM 
model  tluiL  permits  infinite-precision  real  ar  i tliniet  ic , solves 
(1**)  in  time  0(n^‘^^).  Klee  and  barman  (1978)  extend 
Floyd's  metliod  to  find  shortest  paths  amony  those  satisfying 
various  sorts  of  restrictions.  For  networks  in  wliicli  no 
circuit  is  of  negative  length,  Jolinson  (1977)  and  bawler 
(1978)  show  liow  to  take  advantage  of  sparseness  in  the 
directed  case,  and  I.awler  (1976)  sliows  trow  to  usr'  matcliing 
algoritlinis  to  find  shortest  patlis  in  the  undirected  case. 

See  F.ect.ion  6 for  a solution  of  (1**)  wlrose  average-case 
2 2 

com()lex i ty  is  0(n'  log  n)  , and  see  .Section  7 for  lower 
bounds  on  the  complexity  of  sliortest-path  algorittims. 


C.  Spanning  Trees  and  Op>t  imi '/at  ion 
in  Indoponilence  Systi'ins 

I.et  us  turn  now  to  tlie  minimum  spanning  tree  problem  (2). 
For  a connected  undirected  graph  G = (N,A)  and  nonneg.itive 
arc-length  function  a,  we  sc'ek  a spanniirg  tree  of  minimum 
weight.  Call  a subset  of  A ijidepenclent  if  it  contains  no 
circuit.  And  for  x t .X  >-  A,  say  that  x is  a firs^  memlier 
of  X if  a(x)  s a(x')  for  all  x'  t X.  Kruskal  (1996) 
showed  a minimum  spanning  tree  T can  be  constructed  as 
follows,  wliere  U is  the  set  of  unexplored  arcs. 
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begin 

T 0;  U A; 

while  U * 0 ^ begin 

u ♦-  a first  member  of  U; 

U ■<  U~{u}  ; 

if  Tu{u}  is  independent  then  T •«-  Tu{u} 
end; 

print  T 
end. 

Kruskal's  algorithm  can  be  implemented  in  time 

0(a  log  n) . A later  algorithm  of  Prim  (1957)  and  Dijkstra 

2 

(1959)  solves  (2)  in  time  0(n  ) or,  as  implemented  by 
Kerschenbaum  and  Van  Slyke  (1972),  in  time  0(a  log  n) . For 
sparse  graphs  the  best  algorithms  are  those  of  Yao  (1975) 
and  Cheriton  and  Tarjan  (1976),  which  are  0(a  log  log  n) . 

On  the  other  hand,  for  each  positive  integer  k there  are 
algorithms  of  Johnson  (1975)  and  Cheriton  and  Tarjan  (1976) 
which,  when  applied  to  graphs  having  a s cn^  ^ for  some 

fixed  c > 0,  solve  (2)  in  time  0(a). 

Actually,  Kruskal's  procedure  finds  minimum  spanning 
trees  without  any  restriction  on  the  sign  of  a,  so  the  same 
procedure  can  be  used  to  find  maximum  spanning  trees;  simply 
redefine  first  member  to  moan  of  maximum  a-value.  The  name 
of  greedy  algorithm,  proposed  by  Edmonds  (1971) , is  then 
especially  appropriate,  for  one  attempts  at  each  stage  to 
swallow  (add  to  T)  the  largest  raon±>er  u of  U,  refraining 
only  if  u is  immediately  unpalatable.  No  attention  is  paid 
to  the  possibility  that  this  greedy  choice,  however  tempting 


at  the  moment,  may  in  the  long  run  cause  indigestion!  And, 
in  fact,  tlie  greedy  algorithm  not  only  solves  (2)  but  works 
in  the  more  general  matroid  setting  described  below. 

The  theory  of  matroids,  wliich  can  be  axiomatized  in  many 
equivalent  ways,  was  invented  by  Whitney  (1933)  as  a gener- 
alization of  tlie  theory  of  linear  independence.  Let  us 
define  an  i ndependoncc  system  as  a finite  family  T of 
finite  sets,  called  i ndepondent  sets , such  that  every  subset 
of  nn  independent  set  is  independent.  (In  the  example  above, 
a sot  of  arcs  is  independent  if  it  contains  no  circuit.)  A 
matroid  is  an  independence  system  I sucli  that  for  each 
subset  S of  Ul,  all  independent  subsets  of  S are  of  the 
same  cardinality.  An  equivalent  condition  is  that  whenever 
I and  J are  independent  sets  with  |l|  < |J|,  there 
exists  j £ J such  that  Iu{j}  is  independent.  A base  of 
a matroid  I is  a maximal  member  of  I.  (In  tlie  example, 
the  bases  are  the  spanning  trees  of  G.)  Lawler  (197G)  lists 
several  books  and  other  references  on  matroid  theory,  the 
latest  book  being  that  of  Welsli  (1976).  Bruno  and  Weinberg 
(1976a, b)  present  matroid  tI\eory  as  the  proper  foundation 
for  the  study  of  electrical  networks,  an  important  advantage 
being  tliat  tlie  well-developed  duality  tlicory  of  matroids  can 
replace  tlie  purely  grapli-tlieoretic  duality  notions  which  are 
satisfactory  only  for  planar  graplis. 

If  I is  a matroid,  < is  a linear  ordering  of  Ul, 
and  "first"  is  interpreted  in  terms  of  <,  then  with  A re- 
placed by  Ul  the  above  algorithm  produces  a base  T of  I 
that  is  optimal  in  the  following  strong  sense:  for  any  otlier 
base  T'  of  J there  is  a bijection  41:  T -►  T'  with 


t ' ♦(t)  for  nlJ  t e T.  Tliii;  was  ostciblishod  by  Undo 
(1957),  ii'd  i i-.covfrod  indopondontly  by  Gale  (1968),  Wcls)i 
(1968)  and  Kdnionds  (1971),  and  rxtfndod  to  infinite  mati'oids 
by  Klee  (147)).  In  addition  to  t lie  spanning  tree  problem, 
Lawler  (1976)  describes  a " semima tclii ncj"  problem,  a job- 
seqiiencing  problem,  and  an  experimental  design  problem  that 
have  underlying  matroid  structures  and  hence  can  be  solved 
efficiently  by  the  greedy  algorithm.  Of  course,  the  com- 
plexity of  an  implementation  depends  on  the  difficulty  of 
recognizing  members  of  I. 

Each  independence  system  I can  be  expressed  in  various 
ways  as  the  intersection  of  a finite  number  of  matroids. 

Many  important  problems  of  combinator ial  optimization  can  be 
cast  in  the  following  form:  For  an  independence  system  I 
that  is  the  intersection  of  matroids  of  specified 

sorts,  and  for  a nonnegative  function  a on  Ul^,  find  a 
member  of  I tliat  is  of  maximum  a-weight--t  hat  is,  maximize 
^iel  over  all  I £ 1.  Lawli'r  (1976,  Chap.  8)  observes 

that  the  traveling  salesman  problem  (5)  and  the  d-dimcnsitmal 
assignment  problem  (4''')  can  be  formulated  in  this  way  for 
k = 3,  so  good  exact  algoritlims  should  not  be  expected  for 
general  tliree-matroid  optimization  problems.  However,  foi' 
k = 2 Lawler  (1975,  1976)  has  both  a "primal”  algorithm 
analogous  to  a procedute  for  finding  minimum  cost  network 
flows  and  a "primal-dual"  algoiithm  analogous  to  a procedure 
for  finding  maximum  matchings.  The  directed  analogue  of  t lie 
minimum  spanning  tree  problem  (2)  can  be  handled  in  this  way, 


but  is  solved  more  efficiently  by  the  special  methods  of 
Chu  and  Liu  (1965)  and  L'dmonds  (1967).  See  also  Lawler 
(1976)  and  his  references,  and  Tarjan  (1977) . 

When  1,  )i  and  a are  as  in  the  preceding  paragraph, 
and  "first"  moans  of  maximum  a-value,  the  greedy  procedure 
described  earlier  can  be  used  as  a heuristic  for  finding 
indepcndt'iit  sets  of  largo  weiglit.  Results  of  Jcniiyns  (1976) 
and  Korte  and  Hausmann  (1978)  sliow  that  if  1^  is  a member 
of  I of  maximum  a-weight  and  is  the  output  of  the 

greedy  lieuristic  as  applied  to  I and  a,  then 
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where  ir(S)  ^resp.  ur(S)^  is  the  minimum  ^resp.  maximum^ 
of  the  cardinalities  of  the  independent  subsets  of  S.  Both 
inequalities  are  sharp  in  certain  senses.  Sec  Korte  (1978) 
for  a fuller  description  of  "greedy"  heuris-ti cs , their  uses 
and  limitations,  and  for  additional  references. 


6.  THREE  RECENT  .^DV^NCES 


As  is  clear  from  the  recent  dates  of  many  of  the  cited 
references,  the  field  of  combinatorial  optimization  is  in  a 
very  active  stage  of  development.  Three  of  the  most  ex- 
citing recent  advances  have  been  the  Dinic -Karzanov  maximum 


flow  algorithm,  Minty's  good  algorithm  for  finding  maximum 
independent  sets  in  a large  class  of  graphs,  and  a collec- 
tion of  algorithms  whose  average-case  behavior  is 
remarkably  good. 


A.  The  Dinic-Karzanov 
Maximum  Flow  Algorithm 

The  papers  of  Dinic  (1970)  and  Karzanov  (1974)  wore  men- 
tioned earlier.  hike  other  maximum  flow  algorithms,  the 
Dinic-Karzanov  algorithm  starts  with  a feasible  flow  if, 
uses  if  and  the  underlying  network  to  produce  an  auxiliary 
network,  and  by  a computation  in  the  latter  attempts  to  find 
an  improvement  f ' of  if.  If  an  improvement  f ' is  found, 
the  computation  is  repeated  with  f ' in  the  role  of  <f , 
and  wlien  no  improvement  is  found  then  cf  is  optimal.  How- 
ever, in  the  now  algorithm  the  improvements  are  not  made  by 
flow-augmenting  paths  but  in  a more  global  manner  that  leads 

3 

to  an  0(n  ) algorithm.  The  announcement  of  Karzanov  (1974) 

outlines  the  method  but  docs  not  include  details  of  proof. 

Details  can  be  found  in  tlie  book  of  Adelson-Vclsky  et  al. 

(1975)  (in  Russian)  and  in  the  exposition  of  Even  (1976) . 

Vj  z/j 

The  algorithm  has  been  improved  to  0(n  a ) by  Z . Galil. 


34 


r 


I(.  Minty's  Algorithm 
for  Maximum  Independent  Sets 

The  maximum  independent  set  problem  (4'')  is 
NP-complete,  even  when  restricted  to  3-valent  planar  graphs 
(Garey,  Johnson  ai^d  Stockmeyer,  1970),  but  one  may  liope  for 
good  algoritlims  that  solve  the  problem  for  special  classes  of 
graphs.  I'or  references  to  such  algorithms,  and  to  other  re- 
strictions under  which  the  problem  remains  NP-complete,  see 
Garey  and  Johnson  (1978?).  In  the  bipartite  case,  (4'')  is 
well-solved  by  matching  (Lawler,  1976,  pp.  189-196).  By  far 
the  most  successful  attack  on  the  nonbipart ite  case  is  due  to 
Minty  (1978?),  who  shows  that  matching  techniqut's  are  also 
applicable  to  all  graphs  that  are  claw-free.  A c^aw  is  a 
quadruple  of  nodes  (v,w,x,y)  such  that  each  node  in  tlie  set 
S = {w,x,y}  is  adjacent  to  v but  no  two  members  of  S 
are  adjacent. 

To  appreciate  how  close  Minty's  result  comes  to  tlie 
boundary  of  NP-complet eness , consider  the  restricted  form  of 
the  3-dimensional  assignment  problem  mentioned  at  the  end  of 
Section  4.  Let  G be  the  grapli  whose  nodes  are  ttie  ordered 
triples  (i,j,k)  of  integers  between  1 and  n for  which 
a. ..  = 1,  two  nodes  being  adjacent  if  they  agree  in  at  least 

1 3 K 

one  coordinate.  Then  G may  have  claws  but  it  is  free  of 
quintuples  (v,w,x,y,7.)  sucli  that  eacli  node  in  the  set 
S = {w,x,y,z}  is  adjacent  to  v but  t>o  two  members  of  S 


m 


arc  adjacent.  The  problem  of  deciding,  for  each  such  G, 
whether  G contains  an  independent  set  of  cardinality  n, 
is  NP-complcte . 

Minty  also  considers  the  weighted  version  of  (4''),  in 
which  a node-weighting  a is  given  and  one  seeks  an  inde- 
pendent sot  of  maxinuim  weight.  He  shows  that  for  claw- free 
graphs  tliis  problem  is  reducible  to  the  weighted  matching 
problem  and  hence  solvable  in  polynomial  time. 


C.  Some  Algorithms 
with  Good  Average-Case  Behavior 

From  the  worst-case  view'point,  some  well-solved  problems 
of  combinatorial  optimization  have  been  so  intensively 
studied  that  further  significant  improv'ement  in  solution 
methods  seems  unlikely;  also,  most  researchers  doubt  the 
existence  of  good  algorithms  foi  the  NT-complete  problems. 
However,  neither  of  these  comments  applies  to  tlie  average- 
case  viewpoint.  This  fact,  along  with  the  belief  tliat 
average-case  behavior  is  more  important  than  worst-case  be- 
havior in  many  applications,  has  motivated  the  search  for 
algorithms  of  good  average-case  behavior. 

Spira  (1973)  and  Karp  (1978)  consider  a complete  directed 

network  ((N,A),«)  whose  arc  lengths  a(i,j)  are  all 
drawn  independently  from  the  same  probability  distribution  in 
Spira  proposes  an  algorithm  for  the  all-pairs  short- 
est path  problem  (1**)  and  shows  tliat  for  any  distribution 

2 2 

the  expected  running  time  is  0(n  log  n) . Fredman  (1976) 
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shows  how  to  reduce  the  expected  number  of  comparisons  (but 

2 

not  the  overall  complexity)  to  0(n  log  n) . Carson  and  Law 
(1977)  report  on  computations  comparing  Spira's  method  for 
various  distributions  with  an  improvement  of  the  method  of 
Dij)cstra  (1959)  duo  to  Yen  (1972). 

Karp  (1  978)  proposes  an  O(n^)  appro: i mate  algorithm  for 
the  traveling  salesman  problem  (5).  His  algorithm  first 
solves  the  n\n  assignment  prcblem  for  tlic  matrix  (a(i,j)) 
and  tlien  patclies  together  the  cycles  of  the  resulting  pemiu- 
tation  to  form  a tour.  He  shows  ttiat  if  tlio  a(i,j)  are  drawn 
independently  from  the  uniform  distribution  on  ]0,l[  then, 
with  probability  tending  to  1 as  n the  ratio  of  the 

length  of  the  tour  produced  by  the  algorithm  to  that  of  an 
optimum  tour  is  < 1 + c(n),  where  c (n)  -'•0  as  n •>  •»>. 

A tool  in  his  proof  is  a result  of  Walliup  (1977)  on  tlio 
expected  value  of  a random  as.signmcnt  problem.  Similar 
patcliing  schemes  appear  in  algorithms  proposed  by  Karp  (1977) 
for  the  traveling  salesman  problem  in  the  Kuclidean  plane. 

In  tile  results  of  Spira  and  Karp  stated  above,  the  under- 
lying  graph  is  always  complete;  only  the  arc  lengths  vary  at 
random.  For  other  results  on  the  av’oragi'-case  behavior  of 
algorithms,  a notion  of  a random  undirected  graph  tliat  has 
node-set  N = {l,...n}  and  lias  a specified  density  p (or 
has  a specified  number  E of  arcs)  is  required.  Let  t)^ 
denote  the  set  of  all  n(n-l)/2  unordored  pairs  of  distinct 
integers  between  1 and  n,  each  pair  representing  a 
possible  arc  of  a graph  with  node-set  N.  In  the  "p"  model, 
a sample  graph  is  formed  by  including  each  member  of 
with  probability  p,  independent  of  what  other  arcs  are 
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chosen.  In  the  "K"  model,  the  sample  space  consists  of  all 
A «■  for  which  (a|  = E,  all  such  arc-sets  A being 

drawn  with  equal  p'robabi  1 i t ies  . When  E = pn(n-l)/2,  the 
two  models  are  closely  related  (Erdos  and  Renyi,  1961); 
Angluin  and  Valiant  (1977)  sliow  ttiat  if  in  a certain  sense 
an  algoritlim  works  for  one  model  tlien  it  works  for  the  other- 
one  also. 

A matching  is  perfect  if  it  covers  all  nodes.  Erdos  and 
Rdnyi  (1966)  show  that  if  E(n)  - -^n  log  n + u'(n)n  and  if 
PM(n)  is  the  probability  that  a random  graph  with  n nodes 
and  E(n)  arcs  admits  a perfect  matching,  then  as  (even) 
n -►  PM(n)  -►I  if  ui(n)  •*  •»■.  Using  a model  of  compiuta- 

tion  similar  to  but  more  fle.\ible  ttian  tire  RAM  model,  Angluin 
and  Valiant  (1977)  show  there  e.xists  an  algorithm  that  has 
the  following  properties: 

(a)  accepting  as  inpiut  an  undirected  graplr  G witli  n 
nodes,  it  outputs  a perfect  matcliing  or  concludes  correctly 
that  G does  not  admit  one; 

(b)  there  exist  positive  constants  c and  k such  that 
for  each  n and  for  each  E i cn  log  n,  the  exp>ected  run- 
time is  less  tlian  kn  log  n for  random  grap'hs  with  n 
nodes  and  E arcs. 

Pdsa  (1976)  shows  that  if  E(n)  = 6n  log  n witli  6 > 1, 
and  if  T(n)  is  the  probability  that  a random  graph  witli 
n nodes  and  E(n)  arcs  admits  a tour,  then  T(n)  > 1 as 
n -•  Karp  (1976)  uses  Posa's  argument  as  the  basis  of  a 

polynomial  time  algorithm  for  finding  tours  almost  surely. 
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Consideriny  spanning  simple  p;iths  as  well  as  tours,  Angluin 
and  Valiant  (1977)  describe  an  algorithm  that  has  the 
follov/ing  properties: 

(a)  accepting  as  input  an  undirected  graph  G with  n 
nodes,  and  a pair  of  nodes  s,  t with  1 < s < t < n,  it 
attempts  to  find  a simple  patli  from  s to  t that  uses  all 
nodes  (a  tour  when  s = t) ; 

(b)  for  each  y > 0 there  exist  positive  constants  c and 
k sucli  that  for  each  n,  for  each  E > cn  log  n,  and  for 
each  pair  s,  t,  tlie  probability  is  at  least  1 - 0(n  ) 

(for  random  graphs  with  n nodes  and  E arcs)  that  the 
desired  path  is  found  in  fewer  than  kn  log  n steps. 

Essentially  the  same  results  are  obtained  by  Angluin  and 
Valiant  (1977)  for  the  problem  of  finding  a tour  or  a span- 
ning simple  path  in  a directed  graph.  Their  complexity 
estimates  are  based  on  a model  of  computation  in  which 
randomi.-'ed  decisions  arc  possible,  but  are  worsened  only  by 
an  additional  factor  of  log  n when  that  possibility  is  re- 
moved from  the  model. 

See  Karp  (1976)  for  the  results  of  probabilistic  analyses 
of  other  combinatorial  search  algorithms,  including  use  of 
the  greedy  algorithm  to  find  large  independent  sets  of  nodes. 
And  see  Evans  (1976)  for  a treatment  of  maximum  flow  problems 
in  probabilistic  graphs.  Also,  see  the  last  two  paragraphs 
of  Section  7. 
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7.  TIIRI:'.!-:  1)1  Ui:CT JONS  J-QK  KMSMARCH 

Since  t Jio  fioJii  of  combinnlojidl  optimization  i r. 
dovclopinq  so  rapidly,  it  provides  many  directions  loi'  re- 
search and  many  si'ccific  problems  that  beg  for  soluLic'n. 

Rathi-r  tlian  discussing  individual  problems,  we  focus  on  the 
tliri'i'  directions  of  lesearch  that  seem  respectively  to  be 
most  piomisinq  for 

(a)  elucidation  of  the  fundamental  theoretical  issues; 

(b)  unification  of  the  existing  theoiy; 

(c)  production  of  useful  or  interesting  algoi'ithms. 

A.  Finding  bower  lloutids 

The  go.il  heie  is  simply  stated:  Find  sharp  lower  bounds 
for  the  comf'ut  at  i ona  1 cong'K'xity  of  the  important  pioblem:: 
of  combinator  ial  oid.  i mi  zat  ion  , i>ieferal)ly  with  re:'.pect  to 

the  ItAM  or  sonu'  closely  related  model  of  landom  access  i 

comtnit at  ion.  When  a problem  deals  with  netwoiks  ((N,A),n),  ! 

1 

we  say  the  pioblem  is  of  comi)l»'xity  il()(n,a))  if  theie  is  i 

a constant  c > 0 such  that  I'very  algoi  ithm  for  solving  the 

problem  ro<iuires  at  least  ci  (n,a)  steps  for  worst-case 

input  with  paraiiu't i-rs  n and  a.  This  lower  bound  is  sha»^) 

if  the  problem  can  be  solved  by  an  algorithm  of  complexity 

0(i(n,a)),  so  that  the  worst -case  behavior  of  the  problem  is 

deteimined  to  within  a mvil  t i pi  icat  i ve  constant. 


At  least  with  respect  to  the  ItAM  model,  this  direction  of 
research  appears  to  be  very  difficult.  In  particular,  it 
involves  decidinq  whether  P = NP.  Nevertheless,  it  is 
dcscribc?d  as  "promising"  because  it  deals  with  such  important 
questions.  Until  sharp  lower  bounds  arc  found  for  a variety 
of  problems  of  coml>inator i a 1 optimization,  tliere  will  bo  no 
clear  understanding  of  wliy  some  problems  arc  easy  and  otliers 
superficially  resembling  tliem  are  very  hard.  An  upper  bound 
on  the  intrinsic  complexity  of  a problem  can  bo  cstablistied 
by  designing  and  analyzing  an  algorithm  for  its  solution.  If 
tile  design  involves  some  ingenuity  and  the  analysis  is  done 
witli  care,  the  result  is  likely  to  be  interesting  and  pertiaps 
even  useful.  At  the  very  least,  it  is  likely  to  be  "non- 
trivial" in  an  intuitive  sense.  Nontrivial  lowei-  bounds  are 
much  liaider  to  come  by,  because  tiiey  require  consideration  of 
all  conceivaljle  algorithms  and  tlius  demand  a mucli  clearer 
understanding  of  t lie  underlying  logical  issues.  In  fact,  as 
Weide  (1*177)  remarks,  trivial  lower  bounds  are  often  the  only 
ones  available.  Consider,  (or  example,  the  problem  of  find- 
ing a longest  simple  path  bi'twi’en  two  given  node.;  of  a 
compli'te  undirecti'd  network  with  n nodes.  I’he  problem  is 

NP-h.ird  and  hence  probably  does  not  admit  a good  algorithm 

1 nAglhOO 

--not  even  one  of  complexity  0(n‘  ).  However,  the 

best  known  lower  bound  (with  respect  to  the  RAM  model)  is 

2 

the  trivial  one  of  d(n  ),  obtained  by  noting  tliat  none  of 
the  n(n-l)/2  arc  lengths  can  be  ignored  by  any  algorithm 
that  solves  the  (rroblem  for  all  sets  of  input  data. 


Weide  (1977)  discusses  a few  methods  that  have  been  used 
to  establish  lower  bounds,  with  respect  to  various  models  of 
computation,  on  the  complexity  of  combinatorial  problems. 

The  interested  reader  should  consult  Kerr  (1970) , Spira  and 
Pan  (1975),  and  Yao  et  al.  (1977)  concerning  shortest  paths, 
Spira  and  Pan  (1975)  and  Shamos  and  Hoey  (1975)  concerning 
minimum  spanning  trees.  Harper  and  Savage  (1972)  concerning 
maximum  matching.  Harper  and  Savage  (1972)  and  Rabin  (1972) 
concerning  shortest  tours.  Holt  and  Reingold  (1972)  concern- 
ing the  detection  of  circuits  or  connectedness,  and  Rivest 
and  Vuillemin  (1975)  concerning  the  detection  of  an  artitrary 
nontrivial  monotone  graph  property  (a  property  possessed  by 
some  but  not  all  graphs  and  preserved  by  addition  of  arcs) . 
The  most  promising  method  seems  to  me  to  be  the  one  of  Rabin 
(1972),  which  is  used  also  by  Spira  and  Pan  (1974)  and  Yao 
et  al.  (1977).  (The  application  to  shortest  paths  by  Yao 
et  al.  (1977)  is  apparently  invalid  because  it  depends  on  an 
incorrect  counting  argument,  but  the  paper  is  still  of  inter- 
est.) Some  other  papers  related  to  Rabin's  method  are  those 
of  Spira  (1972),  Klee  (1975)  and  Yao  (1976). 


B.  Optimization  in  Oriented  Matroids 

The  theory  of  matroids  is  a combinatorial  generalization 
of  the  theory  of  linear  dependence  in  a vector  space  over  a 
field,  and  of  the  theory  of  linear  equalities.  Similarly, 
the  theory  of  oriented  matroids  may  be  viewed  as  a combina- 
torial generalization  of  the  theory  of  positive  linear 


dependence  in  a vector  space  over  an  ordered  field,  and  of 
the  theory  of  inequalities.  The  notion  of  an  oriented  ma- 
troid  developed  from  the  digraphoids  of  Minty  (19G6)  through 

the  generalizations  of  Camion  (1968),  Fulkerson  (1968)  and 
Rockafellar  (1969) , to  the  present  axiomatic  versions  of 
Bland  (1974),  Las  Vergnas  (1975)  and  Lawrence  (1975).  A good 
basic  reference  is  Bland  and  Las  Vergnas  (1978). 

Realizing  a possibility  first  suggested  by  Rockafellar 
(1969),  Bland  (1974,  1976)  and  Lawrence  (1975)  show  the  basic 
parts  of  linear  programming  tlieory  can  be  extended  to 
oriented  matroids.  Both  have  oriented  matroid  formulations 
of  linear  programming  duality,  and  Bland  (1976,  1977)  also 
has  a finite  pivoting  metliod  wliich  specializes,  in  tire  con- 
text of  linear  programming,  to  Dantzig's  simplex  method  witlr 
a new  pivot  selection  rule.  The  study  of  optimization  in  t)ie 
framework  of  oriented  matroids  seems  to  offer  the  best  chance 
for  the  unification  of  large  parts  of  linear  and  combinato- 
rial optimization.  This  is  related  to  the  approacli  of  T.awlcr 
(1976,  Cliap.  9)  through  matroids  with  parity  conditions,  and 
it  is  important  to  clarify  the  relationships  between  the  two 
approaches . 


C.  Development  of  Algoritlnns 

A striking  aspect  of  combinatorial  mathematics,  and  of 
combinatorial  optimization  in  particular,  is  its  endless 
supply  of  interesting  problems.  To  become  convinced  that  the 
design  and  analysis  of  algorithms  for  combinatorial  optimiza- 


tion  will  continue  for  a long  time,  one  need  only  glance  at  a 
few  volume^  of  Annji^l^  of  Discrete  Mathematics , Information 
Processing  Letters , Journal  of  the  Association  for  Computing 
Machinery,  Mathematical  Programming,  Mathematical  Programming 
Studios,  Networks , Operations  Research,  Proceedings  of  ACM 
Symposia  on  the  Theory  £f  Computing,  Proceedings  of  IEEE 
Symposia  on  the  Foundations  of  Computer  Sci ence , or  the  SIAM 
i^urnal  on  Computing . Below  arc  described  some  directions  of 
algorithm  development  that  seeni  at  present  to  be  especially 
promising. 


Exploring  the  boundary  of  NP-completeness 

By  following  the  advice  in  the  excellent  chapter  of  Garey 
and  Johnson  (1973?)  on  using  NP-completeness  to  analyze 
problems,  designers  of  algorithms  can  help  to  "explore  the 
boundary"  of  NPC,  the  class  of  all  NP-complete  problems.  The 
general  idea  is  that,  having  established  the  membership  of  a 
problem  in  one  of  the  classes  P and  NPC,  one  should  not 
rest  before  finding  a closely  related  problem  in  the  other 
class.  For  each  of  the  problems  (1)  - (4),  Garey  and 
Johnson  (1978?)  describe  several  relatives  in  NPC,  and  for 
(5)  there  are  relatives  in  P.  The  reader  has  met  some  of 
these  relatives  earlier,  and  a few  more  are  described  below. 

The  Chinese  postman  is  a relative  of  the  traveling  sales- 
man. In  serving  his  route  he  must  traverse  each  street  at 
least  once,  one-way  streets  only  in  the  proper  direction,  and 


return  to  his  starting  point.  And  of  course  ho  wants  to 
minimize  the  distance  traveled.  The  postman's  problem  is  in 
P if  all  streets  are  directed  or  all  are  undirected  (Edmonds 
and  Johnson,  1973) , but  the  mixed  case  is  NP-completo 
(Papadimi triou , 1976). 

When  G = (N,A)  and  H = (V,E)  are  undirected  graphs, 
a perfect  H-pac!<  j ng  of  G is  a partition  of  N into  pair- 
wise disjoint  subsets  each  of  which  is  of  cardinality  lv| 
and  induces  in  G a subgraph  isomorphic  to  H.  For  fixed 
H,  the  problem  of  deciding  wiiether  G admits  a perfect 
H-pac)iing  belongs  to  P if  |Vl  fi  2 (use  matching)  but  is 
NP-complete  whenever  Ivl  a 3 (Kirkpatrick  and  Hell,  1978). 

Let  NPI  = NP  ~ (PuNPC) , the  class  of  all  problems  in 
NP  that  are  of  "intermediate"  difficulty  in  the  sense  that, 
even  though  they  can't  be  solved  by  polynomially  bounded 

algorithms,  they're  at  least  not  so  difficult  as  to  be 

NP-complotc.  Under  the  assumption  that  P * NP,  Ladner 
(1975)  proves  NPI  is  nonempty  but  no  one  has  produced  a 
member  of  NPI  that  is  of  practical  importance  or  intrinsic 
interest  aside  from  its  membersliip  in  NPI.  As  noted  by 
Garey  and  Johnson  (1978?),  linear  programming  is  a prime 
candidate.  A class  of  problems  that  seems  well  suited  to 

exploring  tlic  boundary  of  NPC,  and  perhaps  of  NPI,  is 

obtained  as  follows.  Let  F be  the  set  of  all  functions 
f ; •>■  Z^  such  that  f(k)  s k for  all  k t Z^,  the  sot 

of  all  nonnegative  integers.  For  each  f e F an  instance  of 
the  problem  ^resp.  Rj.)  is  associated  with  a graph 


G = (N,A) , a pair  (x,y)  of  distinct  members  of  N,  and  a 
sequence  S of  f(n-2)  distinct  member  of  N~{x,y}.  The 
problem  ^resp.  asks  whether  there  is  a simple  path 

from  X to  y that  uses  all  the  members  of  S in  some 
^resp.  in  the  specified)  order,  other  intermediate  nodes 
being  permitted  as  well.  Note  that  belongs  to  NPC 

when  f(k)  h k,  while  R^,  belongs  to  P whenever  k - f(k) 
is  bounded.  When  f = 0,  both  problems  belong  to  P.  When 
f i 1 and  G is  undirected,  both  problems  belong  to  P. 

(Add  a now  node  t,  new  arcs  {t,x}  and  {t,y}.  Lot  all 
arcs  have  capacity  1.  Solve  as  a maximum  flow  problem  with 
sink  t and  source  s,  where  S = {s}.)  What  happens  for 
other  choices  of  f? 


Approximate  algorithms 

Wo  saw  in  the  result  of  Sahni  and  Gonzalez  (1976)  that 
unless  P = l^P,  the  general  undirected  traveling  salesman 
problem  (5)  does  not  admit  a polynomial ly  bounded  algorithm 
that  is  e-approximate , no  matter  how  largo  e is  chosen. 
On  the  other  hand,  when  G's  arc  lengths  satisfy  the 
triangle  inequality,  the  O(n^)  method  of  Christofides 
(1976)  is  ^-approximate  and  it  may  oven  turn  out  that  for 
each  r > 0 this  special  case  of  (5)  admits  a polynomially 
bounded  c-appro.\imatc  algorithm.  In  short,  thougli  tlie 
NP-complete  problems  arc  computationally  equivalent  so  far 
as  solvability  by  polynomially  bounded  exact  algorithms  is 


concerned,  this  docs  not  apply  to  approximate  algorithms. 
Much  is  known  about  approximate  algorithms  but  mucli  more  re- 
mains to  bo  discovered.  Good  sources  of  information  are  the 
annotated  bibliography  of  Garey  and  Johnson  (1976)  , the 
survey  article  of  Korte  (1978) , and  the  book  of  Garey  and 
Johnson  (1978?) . 


Average-case  and  probabilistic  analysis 
of  algorithms 

In  Part  C of  Section  6,  three  sorts  of  algorithms  are 
lumped  together  as  "algorithms  with  good  average-case 
behavior."  Some  arc  precisely  that,  while  others  are  with 
high  probability  c (n) -approximate , where  f (n)  0 as 

n -*•  “>.  A period  of  considerable  activity  in  the  average- 
case  and  probabilistic  analysis  of  algorithms  for  combinator- 
ial optimization  is  now  under  way.  A basic  paper  is  Karp 
(1976)  , and  Weide  (1977)  also  has  a brief  introduction  to 
the  subject. 

The  papers  on  random  graphs  by  Erdos  and  Renyi  do  not 
treat  algorithms  explicitly,  but  they  contain  many  facts 
tJiat  are  essential  for  the  designer  of  algorithms  in  this 
area;  several  of  these  papers  arc  collected  in  Erdos  (1973). 

In  addition  to  these  and  the  papers  mentioned  in  Part  C of 
Section  6,  the  interested  reader  should  consult  Grimmett  and 
McDiarimid  (1975),  Walkup  (1977b),  Lueker  (1978),  Karp  (1978?) 
Cohen  et  al.  (1973?),  and  Klee  and  barman  (1978?a,b). 
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8.  WHAT  ARE  THE  BEST  SOURCES 
OF  FURTHER  INFORMATION? 

As  is  clear  from  our  frequent  reference  to  them,  the  two 
indispensable  books  for  the  study  of  combinatorial  optimiza- 
tion are  those  of  Lawler  (1976)  for  problems  in  P and  of 
Garey  and  Johnson  (1978?)  for  problems  in  NPC.  But  even 
they  cannot  cover  everything  in  detail,  and  many  of  the 
references  cited  by  them  or  in  our  list  of  references  are 
also  necessary  for  a thorough  understanding  of  the  field. 

We  close  by  mentioning  two  very  active  areas  of  combina- 
torial optimization  whose  omission  here  causes  regret. 

Though  both  are  very  important  for  the  solution  of  practical 
problems,  they  have  been  omitted  because  of  space  considera- 
tions and  because  of  our  decision  to  base  the  entire 
exposition  on  problems  (1)  - (5) . For  access  to  the  tremen- 
dous volume  of  material  on  Scheduling  (several  books, 
hundreds  of  research  papers) , a good  starting  point  is  the 
special  issue  of  Operations  Research  devoted  to  the  subject. 
It  is  listed  here  under  Florian  (1978). 

For  discrete  location  problems,  see  Krarup  and  Pruzan 
(1978)  and  their  references. 
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